Picture for Daizong Liu

Daizong Liu

Wangxuan Institute of Computer Technology, Peking University

Annotations Are Not All You Need: A Cross-modal Knowledge Transfer Network for Unsupervised Temporal Sentence Grounding

Add code
May 29, 2026
Viaarxiv icon

Not All Inputs Are Valid: Towards Open-Set Video Moment Retrieval Using Language

Add code
May 28, 2026
Viaarxiv icon

Fewer Steps, Better Performance: Efficient Cross-Modal Clip Trimming for Video Moment Retrieval Using Language

Add code
May 28, 2026
Viaarxiv icon

Towards Unified Vision-Language Models with Incomplete Multi-Modal Inputs

Add code
May 27, 2026
Viaarxiv icon

Rethinking Video-Language Model from the Language Input Perspective

Add code
May 27, 2026
Viaarxiv icon

Rethinking Weakly-supervised Video Temporal Grounding From a Game Perspective

Add code
May 26, 2026
Viaarxiv icon

AutoResearch AI: Towards AI-Powered Research Automation for Scientific Discovery

Add code
May 22, 2026
Viaarxiv icon

Frequency-Domain Regularized Adversarial Alignment for Transferable Attacks against Closed-Source MLLMs

Add code
May 20, 2026
Viaarxiv icon

From Part to Whole: 3D Generative World Model with an Adaptive Structural Hierarchy

Add code
Mar 23, 2026
Viaarxiv icon

Rethinking Transferable Adversarial Attacks on Point Clouds from a Compact Subspace Perspective

Add code
Jan 30, 2026
Viaarxiv icon